Fuzzy and Rough Set Theory Based Gene Selection Method

نویسندگان

  • C. Kalaiselvi
  • G. M. Nasira
چکیده

The selection of genes from microarray gene expression datasets has become an important research in cancer classification because such data typically consist of a large number of genes and a small number of samples. In this work, Neighborhood mutual information is retrieved to evaluate the relevance between genes and is used to stop information loss. Firstly, an improved Relief Feature Selection algorithm is proposed to create candidate subsets of features. Based on the neighborhood mutual information the cohesion degree of neighborhood object and coupling degree between neighborhood objects have been defined. Furthermore, a new method of initialization for cluster centers in Fuzzy C-means (FCM) algorithm is proposed. FCM allows one piece of data that belong to two or more clusters. Neighborhood rough set is used for extraction and selection of features and is used in proposed FCM algorithm. Finally, to find the performance of the proposed approach, five gene expression datasets were taken. Experimental results show that the proposed approach can select genes effectively, and can obtain high and stable classification performance.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Diagnosis of the disease using an ant colony gene selection method based on information gain ratio using fuzzy rough sets

With the advancement of metagenome data mining science has become focused on microarrays. Microarrays are datasets with a large number of genes that are usually irrelevant to the output class; hence, the process of gene selection or feature selection is essential. So, it follows that you can remove redundant genes and increase the speed and accuracy of classification. After applying the gene se...

متن کامل

A hybrid filter-based feature selection method via hesitant fuzzy and rough sets concepts

High dimensional microarray datasets are difficult to classify since they have many features with small number ofinstances and imbalanced distribution of classes. This paper proposes a filter-based feature selection method to improvethe classification performance of microarray datasets by selecting the significant features. Combining the concepts ofrough sets, weighted rough set, fuzzy rough se...

متن کامل

T-Rough Sets Based on the Lattices

The aim of this paper is to introduce and study set- valued homomorphism on lattices and T-rough lattice with respect to a sublattice. This paper deals with T-rough set approach on the lattice theory. The result of this study contributes to, T-rough fuzzy set and approximation theory and proved in several papers. Keywords: approximation space; lattice; prime ideal; rough ideal; T-rough set; set...

متن کامل

Attribute selection based on information gain ratio in fuzzy rough set theory with application to tumor classification

Tumor classification based on gene expression levels is important for tumor diagnosis. Since tumor data in gene expression contain thousands of attributes, attribute selection for tumor data in gene expression becomes a key point for tumor classification. Inspired by the concept of gain ratio in decision tree theory, an attribute selection method based on fuzzy gain ratio under the framework of...

متن کامل

On fuzzy-rough attribute selection: Criteria of Max-Dependency, Max-Relevance, Min-Redundancy, and Max-Significance

Attribute selection is one of the important problems encountered in pattern recognition, machine learning, data mining, and bioinformatics. It refers to the problem of selecting those input attributes or features that are most effective to predict the sample categories. In this regard, rough set theory has been shown to be successful for selecting relevant and nonredundant attributes from a giv...

متن کامل

Comparing Fuzzy-Rough and Fuzzy Entropy-assisted Fuzzy-Rough Feature Selection

Feature Selection (FS) methods based on fuzzy-rough set theory (FRFS) have employed the dependency function to guide the FS process with much success. More recently a method has been developed which uses fuzzy-entropy [9] to perform this task. Such use of fuzzy-entropy as an evaluation measure in fuzzy-rough feature selection can result in smaller subset sizes than those obtained through FRFS a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014